AITopics | St. Petersburg

e93b673c55d6768cdd39ce90de8c4d4c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-18-2026, 13:30:14 GMT

invariant, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(4 more...)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Neural Information Processing SystemsFeb-18-2026, 13:30:11 GMT

We collect 312 programs from various sources, including daily programs from college homework, the international competition (SV -COMP), benchmarks from previous papers (SLING), and programs from real-world software systems (Linux Kernel, GlibC, LiteOS, and Zephyr).

large language model, loop invariant, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Imitation-Projected Programmatic Reinforcement Learning

Abhinav Verma, Hoang Le, Yisong Yue, Swarat Chaudhuri

Neural Information Processing SystemsFeb-12-2026, 07:16:14 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(7 more...)

Genre: Research Report (0.46)

Industry:

Education (0.46)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

Neural Information Processing SystemsFeb-10-2026, 19:31:09 GMT

Recently, various strategies for distributed training of large language models (LLMs) have been proposed.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > San Diego County > Carlsbad (0.04)
North America > United States > Virginia (0.04)
(12 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Law (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

'People thought I was a communist doing this as a non-profit': is Wikipedia's Jimmy Wales the last decent tech baron?

The GuardianOct-27-2025, 06:00:24 GMT

'People thought I was a communist doing this as a non-profit': is Wikipedia's Jimmy Wales the last decent tech baron? In an online landscape characterised by doom and division, the people's encyclopedia stands out - a huge collective endeavour giving everyone free access to the sum of human knowledge. But with Elon Musk branding it'Wokipedia' and AI looming large, can it survive? W ikipedia will be 25 years old in January. Jimmy Wales's daughter will be 25 and three weeks. It's not a coincidence: on Boxing Day 2000 Wales's then wife, Christine, gave birth to a baby girl, but it quickly became clear that something wasn't right. She had breathed in contaminated amniotic fluid, resulting in a life-threatening condition called meconium aspiration syndrome. An experimental treatment was available at the hospital near where they lived in San Diego. Did they want to try it?

jimmy wales, wales, wikipedia, (13 more...)

The Guardian

Country:

Europe > United Kingdom > Wales (0.29)
North America > United States > California > San Diego County > San Diego (0.24)
Europe > Ukraine (0.05)
(8 more...)

Industry:

Media > News (1.00)
Health & Medicine (1.00)
Leisure & Entertainment > Sports (0.94)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Anonymous Author(s) Affiliation Address email 1 Overview of Supplementary Material

Neural Information Processing SystemsOct-10-2025, 20:11:52 GMT

Dataset Documentation: We have documented our dataset for intended researchers as required. The link to download the models after fine-tuning is https://mega.nz/file/M9FEWCjD# To fill the lack of benchmarks for general loop invariant generation, we propose LIG-MM, a loop invariant generation benchmark of memory manipulation programs. Table 1 below shows the basics of the code in LIG-MM. Multiple examples are shown in Sec. 3, and the Table 1: Statistics of our proposed LIG-MM benchmark.

assertion, invariant, loop invariant, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(4 more...)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Neural Information Processing SystemsOct-10-2025, 20:11:49 GMT

We collect 312 programs from various sources, including daily programs from college homework, the international competition (SV -COMP), benchmarks from previous papers (SLING), and programs from real-world software systems (Linux Kernel, GlibC, LiteOS, and Zephyr).

assertion, invariant, loop invariant, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

Neural Information Processing SystemsOct-9-2025, 22:42:30 GMT

Recently, various strategies for distributed training of large language models (LLMs) have been proposed.

communication, gradient, opération, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > San Diego County > Carlsbad (0.04)
North America > United States > Virginia (0.04)
(12 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Law (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Robust Federated Learning against Noisy Clients via Masked Optimization

Jiang, Xuefeng, Wen, Tian, Yang, Zhiqin, Wu, Lvhua, Chen, Yufeng, Sun, Sheng, Wang, Yuwei, Liu, Min

arXiv.org Machine LearningJun-4-2025

In recent years, federated learning (FL) has made significant advance in privacy-sensitive applications. However, it can be hard to ensure that FL participants provide well-annotated data for training. The corresponding annotations from different clients often contain complex label noise at varying levels. This label noise issue has a substantial impact on the performance of the trained models, and clients with greater noise levels can be largely attributed for this degradation. To this end, it is necessary to develop an effective optimization strategy to alleviate the adverse effects of these noisy clients.In this study, we present a two-stage optimization framework, MaskedOptim, to address this intricate label noise problem. The first stage is designed to facilitate the detection of noisy clients with higher label noise rates. The second stage focuses on rectifying the labels of the noisy clients' data through an end-to-end label correction mechanism, aiming to mitigate the negative impacts caused by misinformation within datasets. This is achieved by learning the potential ground-truth labels of the noisy clients' datasets via backpropagation. To further enhance the training robustness, we apply the geometric median based model aggregation instead of the commonly-used vanilla averaged model aggregation. We implement sixteen related methods and conduct evaluations on three image datasets and one text dataset with diverse label noise patterns for a comprehensive comparison. Extensive experimental results indicate that our proposed framework shows its robustness in different scenarios. Additionally, our label correction framework effectively enhances the data quality of the detected noisy clients' local datasets. % Our codes will be open-sourced to facilitate related research communities. Our codes are available via https://github.com/Sprinter1999/MaskedOptim .

artificial intelligence, learning, machine learning, (15 more...)

arXiv.org Machine Learning

2506.02079

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Asia > China > Beijing > Beijing (0.05)
(21 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Information Technology (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Can LLMs Enable Verification in Mainstream Programming?

Shefer, Aleksandr, Engel, Igor, Alekseev, Stanislav, Berezun, Daniil, Verbitskaia, Ekaterina, Podkopaev, Anton

arXiv.org Artificial IntelligenceMar-18-2025

Although formal methods are capable of producing reliable software, they have seen minimal adoption in everyday programming. Automatic code generation using large language models is becoming increasingly widespread, but it rarely considers producing strong correctness guarantees. In this study, we explore the ability of LLMs to produce verified code in three verification languages (Dafny, Nagini, and Verus). To do so, we use manually curated datasets derived from the state-of-the-art Python benchmark, HumanEval. We also assess what types of information are sufficient to achieve good-quality results.

large language model, machine learning, specification, (15 more...)

arXiv.org Artificial Intelligence

2503.14183

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Filters

Collaborating Authors

St. Petersburg

e93b673c55d6768cdd39ce90de8c4d4c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Imitation-Projected Programmatic Reinforcement Learning

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

'People thought I was a communist doing this as a non-profit': is Wikipedia's Jimmy Wales the last decent tech baron?

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Anonymous Author(s) Affiliation Address email 1 Overview of Supplementary Material

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

Robust Federated Learning against Noisy Clients via Masked Optimization

Can LLMs Enable Verification in Mainstream Programming?